Tibetan |
|
---|---|
Type | Abugida |
Languages | Tibetan Dzongkha Ladakhi Sikkimese Balti |
Time period | c. 650–present |
Parent systems |
Proto-Sinaitic [a]
|
Child systems | Limbu Lepcha Phagspa |
ISO 15924 | Tibt, 330 |
Direction | Left-to-right |
Unicode alias | Tibetan |
Unicode range | U+0F00–U+0FFF |
[a] The Semitic origin of the Brahmic scripts is not universally agreed upon.
Note: This page may contain IPA phonetic symbols. |
The Tibetan alphabet is an abugida of Indic origin used to write the Tibetan language as well as the Dzongkha language, Denzongkha, Ladakhi language and sometimes the Balti language. The printed form of the alphabet is called uchen script (Tibetan: དབུ་ཅན་, Wylie: dbu-can; "with a head") while the hand-written cursive form used in everyday writing is called umê (Tibetan: དབུ་མེད་, Wylie: dbu-med; "headless"). The alphabet is very closely linked to a broad ethnic Tibetan identity. Besides Tibet, it has also been used for Tibetan languages in Bhutan, India, Nepal, and Pakistan.[1] The Tibetan alphabet is ancestral to the Limbu alphabet, the Lepcha alphabet,[2] and the multilingual 'Phags-pa script.[2]
The Tibetan alphabet is romanized in a variety of ways.[3] This article employs the Wylie transliteration system.
|
Contents |
The creation of the Tibetan alphabet is attributed to Thonmi Sambhota of the mid-7th century. Tradition holds that Thonmi Sambhota, a minister of Songtsen Gampo (569-649), was sent to India to study the art of writing, and upon his return introduced the alphabet. The form of the letters is based on an Indic alphabet of that period.[4]
Three orthographic standardizations were developed. The most important, an official orthography aimed to facilitate the translation of Buddhist scriptures, emerged during the early 9th century. Standard orthography has not altered since then, while the spoken language has changed by, for example, losing complex consonant clusters. As a result, in all modern Tibetan dialects, in particular in the Standard Tibetan of Lhasa, there is a great divergence between spelling (which reflects the 9th-century spoken Tibetan) and pronunciation. This divergence is the basis of an argument in favour of spelling reform, to write Tibetan "as it is pronounced", for example, writing "Kagyu" instead of "Bka'-rgyud". In contrast, the pronunciation of the Balti, Ladakhi and Burig languages adheres more closely to the archaic spelling.
|
The Tibetan alphabet has 30 consonants, sometimes known as radicals, which are the basis of the script.[2]
ཀ ka | ཁ kha | ག ga | ང nga |
ཅ ca | ཆ cha | ཇ ja | ཉ nya |
ཏ ta | ཐ tha | ད da | ན na |
པ pa | ཕ pha | བ ba | མ ma |
ཙ tsa | ཚ tsha | ཛ dza | ཝ wa (not originally part of the alphabet)[5] |
ཞ zha [6] | ཟ za | འ 'a [7] | |
ཡ ya | ར ra | ལ la | |
ཤ sha [6] | ས sa | ཧ ha [8] | |
ཨ a |
As in other Indic scripts, each consonant letter assumes an inherent /a/. However, a unique aspect of the Tibetan script is that the consonants can be written either as radicals, or they can be written in other forms, such as superscripts and subscripts. The superscript position above a radical is reserved for the consonants r, l, and s, while the subscript position under a radical is for the consonants y, r, l, and w. To understand how this works, one can look at the radical "ka" and see what happens when it becomes "kra" or "rka". In both cases, the symbol for "ka" is used, but when the r is in the middle of the consonant and vowel, it is added as a subscript. On the other hand, when the r comes before the consonant and vowel, it is added as a superscript.[2] R actually changes form when it is above most other consonants; thus རྐ rka. However, an exception to this is the cluster རྙ rnya. Similarly, the consonants w, r, and y change form when they are beneath other consonants; thus ཀྭ kwa; ཀྲ kra; ཀྱ kya.
Besides being written as subscripts and superscripts, some consonants can also be placed in prescript, postscript, or post-postscript positions. For instance, the consonants g, d, b, m, and ’a ("’a chung") can be used in the prescript position to the left of other radicals, while the position after a radical (the postscript position), can be held by the ten consonants g, n, b, d, m, ’a, r, n̄, s, and l. The third position, the post-postscript position, is solely for the consonants d and s.[2]
The vowels used in the alphabet are a, i, u, e, and o. While the vowel a is included in each consonant or radical, the other vowels are indicated by marks; thus ཀ ka, ཀི ki, ཀུ ku, ཀེ ke, ཀོ ko. The vowels i, e, and o are placed above consonants as diacritics, while the vowel u is placed underneath consonants.[2] Old Tibetan included a gigu 'verso' of uncertain meaning. There is no distinction between long and short vowels in written Tibetan, except in loanwords, especially transcribed from the Sanskrit.
In the Tibetan script, the syllables are written from left to right.[9] Syllables are separated by a tseg (་); since many Tibetan words are monosyllabic, this mark often functions almost as a space. Spaces are not used to divide words.
Although some Tibetan dialects are tonal, the language had no tone at the time of the script's invention, and there are no dedicated glyphs for tone. However, since tones developed from segmental features they can usually be correctly predicted by the archaic spelling of Tibetan words.
As in other Indic scripts, clustered consonants are often stacked vertically. Unfortunately, some fonts and applications do not support this behavior for Tibetan, so these examples may not display properly; you might have to download a font such as Tibetan Machine Uni.
Devanagari | IAST | Tibetan | Dependent vowel signs | Devanagari | IAST | Tibetan | Dependent vowel signs | |
---|---|---|---|---|---|---|---|---|
अ | a | ཨ | औ | au | ཨཽ | ཽ | ||
आ | ā | ཨཱ | ཱ | ऋ | ṛ | རྀ | ྲྀ | |
इ | i | ཨི | ི | ॠ | ṝ | རཱྀ | ཷ | |
ई | ī | ཨཱི | ཱི | ऌ | ḷ | ལྀ | ླྀ | |
उ | u | ཨུ | ུ | ॡ | ḹ | ལཱྀ | ཹ | |
ऊ | ū | ཨཱུ | ཱུ | अं | aṃ | ཨཾ | ཾ | |
ए | e | ཨེ | ེ | अँ | ཨྃ | ྃ | ||
ऐ | ai | ཨཻ | ཻ | अः | aḥ | ཨཿ | ཿ | |
ओ | o | ཨོ | ོ |
Devanagari | IAST | Tibetan | Devanagari | IAST | Tibetan | |
---|---|---|---|---|---|---|
क | ka | ཀ | द | da | ད | |
ख | kha | ཁ | ध | dha | དྷ | |
ग | ga | ག | न | na | ན | |
घ | gha | གྷ | प | pa | པ | |
ङ | ṅa | ང | फ | pha | ཕ | |
च | ca | ཙ | ब | ba | བ | |
छ | cha | ཚ | भ | bha | བྷ | |
ज | ja | ཛ | म | ma | མ | |
झ | jha | ཛྷ | य | ya | ཡ | |
ञ | ña | ཉ | र | ra | ར | |
ट | ṭa | ཊ | ल | la | ལ | |
ठ | ṭha | ཋ | व | va | ཝ | |
ड | ḍa | ཌ | श | śa | ཤ | |
ढ | ḍha | ཌྷ | ष | ṣa | ཥ | |
ण | ṇa | ཎ | स | sa | ས | |
त | ta | ཏ | ह | ha | ཧ | |
थ | tha | ཐ | क्ष | kṣa | ཀྵ |
The Sanskrit "cerebral" (retroflex) consonants ट ठ ड ण ष (ṭa, ṭha, ḍa, ṇa, ṣa) are represented by the reversing the letters ཏ ཐ ད ན ཤ (ta, tha, da, na, sha) to give ཊ ཋ ཌ ཎ ཥ (Ta, Tha, Da, Na, Sa).
It is a classic rule to transliterate च छ ज झ (ca cha ja jha) to ཙ ཚ ཛ ཛྷ (tsa tsha dza dzha), respectively. Nowadays, ཅ ཆ ཇ ཇྷ (ca cha ja jha) can also be used.
Tibetan was originally one of the scripts in the first version of Unicode Standerd in 1991, in the Unicode block U+1000–U+104F. However, in 1993, in version 1.1, it was removed (the code points it took up would later be used for the Burmese script in version 3.0). The Tibetan script was re-added in July, 1996 with the release of version 2.0.
The Unicode block for Tibetan is U+0F00–U+0FFF. It includes letters, digits and various punctuation marks and special symbols used in religious texts. Grey areas indicate non-assigned code points:
Tibetan[1] Unicode.org chart (PDF) |
||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+0F0x | ༀ | ༁ | ༂ | ༃ | ༄ | ༅ | ༆ | ༇ | ༈ | ༉ | ༊ | ་ | ༌ | ། | ༎ | ༏ |
U+0F1x | ༐ | ༑ | ༒ | ༓ | ༔ | ༕ | ༖ | ༗ | ༘ | ༙ | ༚ | ༛ | ༜ | ༝ | ༞ | ༟ |
U+0F2x | ༠ | ༡ | ༢ | ༣ | ༤ | ༥ | ༦ | ༧ | ༨ | ༩ | ༪ | ༫ | ༬ | ༭ | ༮ | ༯ |
U+0F3x | ༰ | ༱ | ༲ | ༳ | ༴ | ༵ | ༶ | ༷ | ༸ | ༹ | ༺ | ༻ | ༼ | ༽ | ༾ | ༿ |
U+0F4x | ཀ | ཁ | ག | གྷ | ང | ཅ | ཆ | ཇ | ཉ | ཊ | ཋ | ཌ | ཌྷ | ཎ | ཏ | |
U+0F5x | ཐ | ད | དྷ | ན | པ | ཕ | བ | བྷ | མ | ཙ | ཚ | ཛ | ཛྷ | ཝ | ཞ | ཟ |
U+0F6x | འ | ཡ | ར | ལ | ཤ | ཥ | ས | ཧ | ཨ | ཀྵ | ཪ | ཫ | ཬ | |||
U+0F7x | ཱ | ི | ཱི | ུ | ཱུ | ྲྀ | ཷ | ླྀ | ཹ | ེ | ཻ | ོ | ཽ | ཾ | ཿ | |
U+0F8x | ྀ | ཱྀ | ྂ | ྃ | ྄ | ྅ | ྆ | ྇ | ྈ | ྉ | ྊ | ྋ | ྌ | ྍ | ྎ | ྏ |
U+0F9x | ྐ | ྑ | ྒ | ྒྷ | ྔ | ྕ | ྖ | ྗ | ྙ | ྚ | ྛ | ྜ | ྜྷ | ྞ | ྟ | |
U+0FAx | ྠ | ྡ | ྡྷ | ྣ | ྤ | ྥ | ྦ | ྦྷ | ྨ | ྩ | ྪ | ྫ | ྫྷ | ྭ | ྮ | ྯ |
U+0FBx | ྰ | ྱ | ྲ | ླ | ྴ | ྵ | ྶ | ྷ | ྸ | ྐྵ | ྺ | ྻ | ྼ | ྾ | ྿ | |
U+0FCx | ࿀ | ࿁ | ࿂ | ࿃ | ࿄ | ࿅ | ࿆ | ࿇ | ࿈ | ࿉ | ࿊ | ࿋ | ࿌ | ࿎ | ࿏ | |
U+0FDx | ࿐ | ࿑ | ࿒ | ࿓ | ࿔ | ࿕ | ࿖ | ࿗ | ࿘ | ࿙ | ࿚ | |||||
U+0FEx | ||||||||||||||||
U+0FFx | ||||||||||||||||
Notes
|
|